Adaptive Cluster Ensemble Selection

نویسندگان

  • Javad Azimi
  • Xiaoli Z. Fern
چکیده

Cluster ensembles generate a large number of different clustering solutions and combine them into a more robust and accurate consensus clustering. On forming the ensembles, the literature has suggested that higher diversity among ensemble members produces higher performance gain. In contrast, some studies also indicated that medium diversity leads to the best performing ensembles. Such contradicting observations suggest that different data, with varying characteristics, may require different treatments. We empirically investigate this issue by examining the behavior of cluster ensembles on benchmark data sets. This leads to a novel framework that selects ensemble members for each data set based on its own characteristics. Our framework first generates a diverse set of solutions and combines them into a consensus partition P*. Based on the diversity between the ensemble members and P*, a subset of ensemble members is selected and combined to obtain the final output. We evaluate the proposed method on benchmark data sets and the results show that the proposed method can significantly improve the clustering performance, often by a substantial margin. In some cases, we were able to produce final solutions that significantly outperform even the best ensemble members.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wised Semi-Supervised Cluster Ensemble Selection: A New Framework for Selecting and Combing Multiple Partitions Based on Prior knowledge

The Wisdom of Crowds, an innovative theory described in social science, claims that the aggregate decisions made by a group will often be better than those of its individual members if the four fundamental criteria of this theory are satisfied. This theory used for in clustering problems. Previous researches showed that this theory can significantly increase the stability and performance of...

متن کامل

Wised Semi-Supervised Cluster Ensemble Selection: A New Framework for Selecting and Combing Multiple Partitions Based on Prior knowledge

The Wisdom of Crowds, an innovative theory described in social science, claims that the aggregate decisions made by a group will often be better than those of its individual members if the four fundamental criteria of this theory are satisfied. This theory used for in clustering problems. Previous researches showed that this theory can significantly increase the stability and performance of...

متن کامل

Image Segmentation Fusion Using General Ensemble Clustering Methods

A new framework for adapting common ensemble clustering 9 methods to solve the image segmentation combination problem is pre10 sented. The framework is applied to the parameter selection problem in 11 image segmentation and compared with supervised parameter learning. 12 We quantitatively evaluate 9 ensemble clustering methods requiring a 13 known number of clusters and 4 with adaptive estimati...

متن کامل

Optimal Cluster Selection Based on Ant Colony Optimization for Cluster Oriented Ensemble Classifier in Stream data classification

In this paper we proposed a method of optimal selection of cluster for cluster oriented classifier. The cluster oriented classifier is great advantage over binary and conventional classifier. The cluster oriented classifier work very efficiently on real and sample data. But the cluster oriented ensemble classifier faced a problem of selection of number of cluster for ensemble. In current fashio...

متن کامل

Hierarchical cluster ensemble selection

Clustering ensemble performance is affected by two main factors: diversity and quality. Selection of a subset of available ensemble members based on diversity and quality often leads to a more accurate ensemble solution. However, there is not a certain relationship between diversity and quality in selection of subset of ensemble members. This paper proposes the Hierarchical Cluster Ensemble Sel...

متن کامل

A Review of Cluster Based Classification Technique

Fusion and ensemble is important technique of machine learning. Fusion fused the feature attribute of different classifier and improved the classification of binary classifier. Instead of that ensemble technique provide the facility of merge two individual classifier and improve the performance of both classifiers. The ensemble technique of classifier depends on number of nearer point of classi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009